Asymptotically Efficient Adaptive Strategies in Repeated Games Part II. Asymptotic Optimality
نویسندگان
چکیده
Your use of the JSTOR archive indicates your acceptance of JSTOR's Terms and Conditions of Use, available at . http://www.jstor.org/page/info/about/policies/terms.jsp. JSTOR's Terms and Conditions of Use provides, in part, that unless you have obtained prior permission, you may not download an entire issue of a journal or multiple copies of articles, and you may use content in the JSTOR archive only for your personal, non-commercial use.
منابع مشابه
Asymptotically Efficient Adaptive Strategies in Repeated Games Part I: Certainty Equivalence Strategies
This paper addresses the problem of dynamic decision making in an uncertain and competitive environment. A decision maker (player 1) faces a system about which he has some (parametric) uncertainty, and which is affected also by the actions of other agents. We focus on a worst-case analysis from the viewpoint of player 1, using the simplified model of a repeated matrix game with lack of informat...
متن کاملAsymptotically Efficient Adaptive Strategies in Repeated Games. Part I: Certainty Equivalence
Your use of the JSTOR archive indicates your acceptance of JSTOR's Terms and Conditions of Use, available at . http://www.jstor.org/page/info/about/policies/terms.jsp. JSTOR's Terms and Conditions of Use provides, in part, that unless you have obtained prior permission, you may not download an entire issue of a journal or multiple copies of articles, and you may use content in the JSTOR archive...
متن کاملSubsolutions of an Isaacs Equation and Efficient Schemes for Importance Sampling
It was established in [6, 7] that importance sampling algorithms for estimating rare-event probabilities are intimately connected with two-person zero-sum differential games and the associated Isaacs equation. This game interpretation shows that dynamic or state-dependent schemes are needed in order to attain asymptotic optimality in a general setting. The purpose of the present paper is to sho...
متن کاملFinite Time Analysis of Optimal Adaptive Policies for Linear-Quadratic Systems
We consider the classical problem of control of linear systems with quadratic cost. When the true system dynamics are unknown, an adaptive policy is required for learning the model parameters and planning a control policy simultaneously. Addressing this trade-off between accurate estimation and good control represents the main challenge in the area of adaptive control. Another important issue i...
متن کاملAsymptotic Optimality of the Bayes Estimator on Differentiable in Quadratic Mean Models
This paper deals with the study of the Bayes estimator’s asymptotic properties on Differentiable in Quadratic Mean (DQM) models in the case of independent and identically distributed observations. The investigation is led in order to define weak assumptions on the model under which this estimator is asymptotically efficient, regular and asymptotically of minimal risk. The results of the paper a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Math. Oper. Res.
دوره 21 شماره
صفحات -
تاریخ انتشار 1996